Wavelet-Based Data Distortion for Privacy-Preserving Collaborative Analysis

نویسندگان

  • Lian Liu
  • Jie Wang
  • Zhenmin Lin
  • Jun Zhang
چکیده

With the rapid development of modern data collection and data warehouse technologies, data mining is becoming more and more a standard practice. Accompanying this trend, preserving privacy in certain data becomes a challenge to data mining applications in many fields, especially in medical, financial and homeland security fields. We present a class of novel privacy-preserving data distortion methods in the collaborative analysis situations based on wavelet transformation, which provides an effective and efficient balance between data utilities and privacy protection beyond its fast run time. We also provide a new privacy breach algorithm in the collaborative analysis which could threaten the data privacy, even with the distorted data values, in the single basis wavelet transformation case. Thus, we further propose a multi-basis wavelet data distortion strategy for better privacy preserving in these situations. Through experiments on real-life datasets, we conclude that the multi-basis wavelet data distortion method is a very promising privacy-preserving technique. Technical Report No. 482-07, Department of Computer Science, University of Kentucky, Lexington, KY, 2007. The research work of the authors was supported in part by the National Science Foundation under grant CCF-0527967, in part by the National Institutes of Health under grant 1-R01-HL086644-01, in part by the Kentucky Science and Engineering Foundation under grant KSEF-148-502-06-186, and in part by the Alzheimer’s Association under grant NIGR-06-25460. Corresponding author. E-mail: [email protected]. URL: http://www.cs.uky.edu/ jzhang.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wavelet-Based Data Distortion for Simultaneous Privacy-Preserving and Statistics-Preserving

With the rapid development of data mining technologies, preserving privacy in certain data becomes a challenge to data mining applications in many fields, especially in medical, financial and homeland security fields. We present a class of novel privacy-preserving data distortion methods in collaborative analysis situations based on wavelet transformation, to keep the data privacy and data stat...

متن کامل

On Random Additive Perturbation for Privacy Preserving Data Mining

Title of Thesis: On Random Additive Perturbation for Privacy Preserving Data Mining Author: Souptik Datta, Master of Science, 2004 Thesis directed by: Dr. Hillol Kargupta, Associate Professor Department of Computer Science and Electrical Engineering Privacy is becoming an increasingly important issue in many data mining applications. This has triggered the development of many privacy-preserving...

متن کامل

An Improved Privacy-Preserving Collaborative Filtering Recommendation Algorithm

Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals’ privacy. However, Collaborative filtering with privacy schemes commonly suffers from scalability and sparseness. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this...

متن کامل

A Perturbation Method Based on Singular Value Decomposition and Feature Selection for Privacy Preserving Data Mining

In this study, a new model is provided for customized privacy in privacy preserving data mining in which the data owners define different levels for privacy for different features. Additionally, in order to improve perturbation methods, a method combined of singular value decomposition (SVD) and feature selection methods is defined so as to benefit from the advantages of both domains. Also, to ...

متن کامل

P2P collaborative filtering with privacy

With the evolution of the Internet and e-commerce, collaborative filtering (CF) and privacy-preserving collaborative filtering (PPCF) have become popular. The goal in CF is to generate predictions with decent accuracy, efficiently. The main issue in PPCF, however, is achieving such a goal while preserving users’ privacy. Many implementations of CF and PPCF techniques proposed so far are central...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007